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We present a search for WW and WZ production in final states that contain a charged 
lepton (electron or muon) and at least two jets, produced in y^s = 1.96 TeV pp collisions 
at the Fermilab Tevatron, using data corresponding to 1.2 fb _1 of integrated luminosity 
collected with the CDF II detector. Diboson production in this decay channel has yet to 
be observed at hadron colliders due to the large single W plus jets background. An artifi- 
cial neural network has been developed to increase signal sensitivity, as compared with an 
event selection based on conventional cuts. We set a 95% confidence level upper limit of 
aww x BR(W — > Ivi, W — > jets) + awz x BR(W — > £vt, Z — ► jets) < 2.88 pb, which is consistent 
with the standard model next to leading order cross section calculation for this decay channel of 
2.09 ±0.12 pb. 
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In the standard model of particle physics (SM), the 
weak bosons (W, Z) and the photon are the gauge bosons 
of the local SU(2) x U(l) symmetry. The sponta- 
neous breaking of this symmetry gives masses to the W 
and Z bosons, while the gauge symmetry itself defines 
the interactions among these heavy bosons and the pho- 
ton. Since the electroweak sector of the SM relies on 
this mechanism, it is of prime importance to test the bo- 
son couplings experimentally. We present in this paper a 
search for WW and WZ production in the charged lep- 
ton (electron or muon), neutrino plus jets decay channel 
[I]. Figure [I] shows the leading order diagrams for the 
pp — > W{— > &/)V(— > jets) process, where V = W,Z. 
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FIG. 1: Leading-order diagrams for WW and WZ production 
in the semi-leptonic decay channel; there are similar diagrams 
for the charge conjugates of the processes shown. 



The production of WW and WZ could be more sen- 
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sitive to the triple gauge couplings (TGC) WW(Z/j), 
present in the s-channel (Fig. [T^, b), and would 
be enhanced by the presence of nonstandard couplings 
(anomalous TGC) [2J. The hadronically decaying W 
(W — > jets) cannot be differentiated from hadronically 
decaying Z (Z — > jets) due to the limited jet energy res- 
olution of the detector [3]. We therefore search for the 
combined WW and WZ production. 

The next to leading order (NLO) SM cross sections 
times branching ratio for these modes are <tww x 
BR{W -> lv t ,W -> jets) = 1.81 ± 0.12 pb and 
a wz x BR{W -> iv t , Z -> jets) = 0.28 ± 0.02 pb 0J[S]. 

The DO collaboration recently reported the first evi- 
dence for the WW and WZ production in lepton plus 
jets decay mode [6]. This decay mode has not been ob- 
served yet at hadron colliders due to the large W plus 
jets background. The cross section for W plus two or 
more jets production at \fs = 1-96 TeV is at least two 
orders of magnitude larger than the total cross section 
times branching ratio of the signal [7] , which translates 
into an expected signal to background ratio of less than 
1%. Given that this diboson production is topologically 
similar to the associated production of Higgs and W 
bosons, techniques that are developed for the WW and 
WZ searches are of key importance for Higgs searches. 
The lepton plus jets final state is common in other inter- 
esting processes, such as top production; thus, diboson 
production decaying in this channel is a significant back- 
ground to these processes, and vice-versa. 

The search for WW and WZ production is performed 
using data corresponding to 1.2 fb _1 of integrated lumi- 
nosity collected with the CDF II detector from pp colli- 
sions at y/s = 1.96 TeV at the Fermilab Tevatron. The 
CDF II detector is a general-purpose, multilayered detec- 
tor designed to study many aspects of particle physics. It 
combines precise tracking systems with calorimeters and 
muon detectors [5] ■ A tracking system is positioned clos- 
est to the beamline to provide accurate momentum de- 
termination of charged particles. The tracking system is 
immersed in a 1.4 T uniform magnetic field, produced by 
a superconducting solenoid and aligned along the proton 
direction. Calorimeters located outside the tracking vol- 
ume provide energy measurement of electrons, photons, 
and jets. The geometrical coverage of the calorimeters 
is maximized to measure the energy flow of all particles 
produced in a collision and indirectly detect the neutri- 
nos by the presence of missing transverse energy J£ T [9] . 
Muon chambers are located on the outer part of the CDF 
II detector. 

The selection of the signal events proceeds as follows. 
The trigger system selects events with leptons of central 
pseudorapidity |?7| < 1; electron candidates with trans- 
verse energy Et > 18 GeV or muon candidates with 
transverse momentum pt > 18 GeV/c. Events that are 
reconstructed offline are required to contain one electron 
candidate with Et > 25 GeV or one muon candidate 
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with pt > 25 GeV/c. The sample is enriched in events 
containing a neutrino by requiring that the _J? T , corrected 
for the calorimeter energy leakage and the presence of 
muons, satisfies Tfi T > 25 GeV. 

The jets are reconstructed in the calorimeter using 
the jetclu cone algorithm [12! with cone radius R = 
y/ A(j) 2 + Ar/ 1 = 0.4. Starting from seed locations cor- 
responding to calorimeter towers with Et > 1 GeV, all 
nearby towers with Et > 0.1 GeV are used to search 
for stable cones. To resolve ambiguities due to overlaps, 
cones sharing an energy fraction greater than 0.75 are 
merged into a single jet. The measured energy deposi- 
tion in the detector is corrected for effects that distort 
the true jet energy [3]. Such effects include the non- 
linear response of the calorimeter to the particle energy, 
uninstrumented regions of the detector, spectator inter- 
actions, and energy radiated outside the cone. We select 
events that contain two or more jets with E T et > 15 GeV. 
To enhance the signal selection, events are rejected if the 
difference in pseudorapidity between the two leading jets, 
A?7(Jetl, Jet2), is greater than 2.5. 

The WW and WZ production in the lepton plus 
jets event signature is simulated using pythia v6.3 [TU] 
Monte Carlo generator, followed by GEANT-based [TTJ 
CDF detector simulation. We search for diboson produc- 
tion in the region of [45,160] GeV/c 2 in the dijet invariant 
mass, that is constructed taking the two leading jets into 
account. Using the signal Monte Carlo description, we 
define a signal region of [60,100] GeV/c 2 . It contains 
approximately 80% of the reconstructed hadronically de- 
caying W bosons. Outside of the signal region we define 
a lower sideband region of [45,60] GeV/c 2 and a higher 
sideband region of [100,160] GeV/c 2 . We enhance the 
W event selection by rejecting events if the transverse 
mass Mt of the lepton and $ T system is not within the 
interval 30 GeV/c 2 < M T < 120 GeV/c 2 . 

The most significant background to the WW and WZ 
search in the lepton plus jets decay channel consists of W 
plus jets events where the leptonically decaying W boson 
is produced in association with jets that mimic a hadron- 
ically decaying W or Z. The W plus jets background is 
simulated using the ALPGEN vl.3 [13] Monte Carlo gen- 
erator, followed by pythia Monte Carlo generator for 
the parton shower and fragmentation, and full GEANT 
detector simulation. Other, less significant backgrounds 
originate from a tau lepton that is detected as an electron 
or a muon; events with large transverse energy due to the 
Drell-Yan process, where one of the two leptons is not re- 
constructed; and QCD events with a jet misidentified as 
a lepton. The QCD background is derived from the data, 
while the other background processes are simulated using 
pythia Monte Carlo events. 

Based on the Monte Carlo simulation, in the dijet in- 
variant mass region of [45,160] GeV/c 2 we predict 716 sig- 
nal (S) events and 29,093 background (B) events. The 
estimated signal fraction (S/(S + B)) is small (0.024). 



The statistical significance (S/ y/(S + B)) is equal to 4.1. 
In order to increase the sensitivity to WW and WZ in 
the lepton plus jets final state, more sophisticated tech- 
niques beyond event counting are required. Correlations 
between kinematic quantities are exploited using a feed- 
forward artificial neural network (ANN) [T4] , 

A feed-forward ANN can be thought of as a single- 
valued function of input vectors. The function has many 
parameters, the values of which determine the output for 
a given input vector. Usually the output is a continuous 
distribution in the range to 1. The training of the 
network is equivalent to a minimization procedure. The 
aim is to reduce the error function, which is the sum of 
the squared deviations of the neural network output from 
the desired output for signal (usually 1) and background 
(usually 0). When the trained network with its optimised 
parameters is used with real events, the network output 
for each event is used to define if the event is selected or 
not [IS]. 

The ANN we developed for this analysis is trained us- 
ing six input variables that can discriminate the signal 
from the background. The output of the ANN is a vari- 
able where the signal and background are well separated. 
We perform the ANN training using angles and event 
shape quantities and ensure that cutting on the output 
of this ANN does not introduce significant bias on the 
signal and background dijet invariant mass shapes. 

The quantities used in the ANN training are shown 
in Fig. [2] We used the difference between the pseu- 
dorapidity 77 of the leading jets (A?i(Jetl, Jet2)); the 
maximum value of the pseudorapidity 77 of the two jets 
(max 7?(Jetl, Jet2)); the fraction J2Pt/J2p 2 > wnere the 
sum is over all objects, leptons and jets (for the neu- 
trino the |? T is used in the denominator); the fraction 
J2pt/J2p 2 > where the sum is over the two leading jets; 
and finally the quantities A# 12 = 0j e ti ~ &jet2 an d 
AOdijet.i, both calculated in the rest frame of the dijet 
system. The quantity A9dij e t,i is given by the expression 
A6dijet,i = \0Jeti - 0dijet\, if Ojeti • dijet > 0; otherwise, 
it is given by A9 dijet ,i = |tt- 9 Jetl - 9 dijet \, where 9 Jet i 
and 9dijet are calculated in the rest frame of the dijet 
system. 

The training is performed using the variables in the 
signal region only. Both signal and background descrip- 
tions are given by Monte Carlo simulations. The ANN 
has been trained for the electron and the muon channels 
combined. 

The ANN output is shown in Fig. [3] A cut is applied 
at the point where the statistical significance is maxi- 
mized. After applying this cut in Monte Carlo simulated 
events, we estimate within the dijet invariant mass re- 
gion of [45,160] GeV/c 2 an expected number of 554 sig- 
nal events and 14,481 background events. The signal 
fraction is 0.037, improved by 53% with respect to the 
value before the ANN cut was applied. The statistical 
significance is 4.5, an improvement of about 10%. 
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FIG. 2: Neural Network input variables. The ANN is trained 
with events in the signal region only and for electrons and 
muons simultaneously. Both signal and background descrip- 
tions are given by Monte Carlo simulated events. 



A comparison between the data and the Monte Carlo 
simulated events of the ANN output shape in the side- 
band regions is shown in Fig. [4] For the ANN output in 
the sidebands, the data are well described by the Monte 
Carlo simulation. 

We measure the signal fraction in the data by perform- 
ing a likelihood fit on the dijet invariant mass distribu- 
tion. The shape of the dijet invariant mass is parameter- 
ized for the events that pass the cut in the ANN output. 
To obtain the signal parameterization, we use PYTHIA 
Monte Carlo simulated events. The background model is 
derived from the Monte Carlo simulation and is fit with 
the form PDF BGR oc exp (ax + f3x 2 ) (PDF = Probabil- 
ity Density Function), with a and /3 as free parameters. 
The overall parameterization consists of the signal and 
background descriptions, with the signal fraction fg be- 
ing one additional free parameter. A likelihood function 
C is constructed using this parameterization with a total 
of three free parameters a, 0, and fg; a fit is performed 
to the data. The fit returns the parameters a and (3 as 
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FIG. 3: ANN Output, interpreted as a function associating 
the output, from 0.0 to 1.0, to each event. The distributions 
for signal and background samples are shown. A cut is ap- 
plied at the value 0.46. This is the value where the statistical 
significance is maximized, in the context of this specific ANN 
output. 
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FIG. 4: Comparison of the experimental data with Monte 
Carlo simulation in the dijet invariant mass sidebands for the 
ANN output. The \ 2 probability the agreement between the 
experimental data and the Monte Carlo simulation is w 30%. 



well as the signal fraction, which is then converted into 
the number of events. 

The signal fraction measured from the data over a di- 
jet invariant mass region of [45,160] GeV/c 2 , and for the 
events that pass the ANN cut, is f s = 0.027 ± 0.014. 
Given a total of 15,016 events, this signal fraction cor- 
responds to 410 ± 213 signal events. The uncertainty is 
statistical, obtained from the fit, and accounts for the 
poisson fluctuations of the total number of events mea- 
sured on the data. 

The likelihood fit from the data is shown in Fig. [5] The 
overall fit result (signal plus background) and the mea- 



7 



sured background shape are displayed. Figure [6] shows 
the signal shape that is measured on the data, obtained 
by subtracting the background from the data. The signal 
shape is compared to the expected signal shape, normal- 
ized according to the measured signal fraction. 
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FIG. 5: Likehood fit on data (solid line). The dashed line 
shows the background estimation, as given by the data. The 
inset provides a close-up in the signal region. 
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FIG. 6: The signal shape measured on data. The plot is made 
by subtracting the measured background shape from the data. 
The errors are statistical only. The solid line corresponds 
to the Monte Carlo simulated events shape for the measured 
signal fraction. There is a good agreement between the Monte 
Carlo events shape and the shape seen in the data. 

The significance of the result is evaluated using the 
likelihood ratio, Q = £ S+B (a, (3, / s )/£ B (a,/3), as a test 
statistic. We test the SM signal plus background hypoth- 
esis and the background-only hypothesis by analyzing a 



set of simulated experiments, as done in data. We per- 
form a fit for the three free parameters, and each time cal- 
culate the likelihood ratio. Using Monte Carlo simulated 
experiments, we estimate a ~ 2.5cr statistical significance 
for the expected signal given its SM cross section. From 
the data, we measure a 1.9a statistical significance. The 
data are thus compatible with SM expectations, and we 
estimate an upper limit of the WW and WZ cross sec- 
tion. 

The cross section times branching ratio that corre- 
sponds to the measured number of signal events is es- 
timated using the formula a x BR — N S i gna i/a ■ e • Jzf, 
where N S i gna i is the measured number of signal events; 
a is the signal acceptance, derived from the Monte Carlo 
simulated events; e is the global efficiency factor that in- 
cludes vertex, tracking and trigger efficiencies; and is 
the total integrated luminosity of the data we used. The 
product of the acceptance times efficiency is estimated 
separately for electrons and muons. 

To assess the effects of systematic uncertainties on the 
measurement, we address separately two kinds of system- 
atic uncertainties: those that affect the signal fraction, 
and those that affect the acceptance. The signal fraction 
uncertainties define the uncertainty in the significance of 
the measurement. 

The dominant systematic uncertainty in the signal 
fraction come from the background shape parameteriza- 
tion. The background shape is fit to the form PDF bgr <x 
exp {ax + (3x 2 ), as already described, which has two pa- 
rameters a and /3. This form gives an adequate fit to 
both the Monte Carlo simulation and the data. In order 
to quantify the size of the systematic uncertainty asso- 
ciated with the background shape, fits with additional 
parameters in the exponent (from three to six) were car- 
ried out. The variations obtained were used to assign the 
systematic uncertainty. 

Other systematic uncertainties in the signal fraction 
include those originating from the energy scale of the 
jets (JES). The effect in the signal fraction is quantified 
by varying the parameters of the signal shape to account 
for the ±1(7 variations of the JES. We generate simu- 
lated experiments with the new parameterizations and 
fit them with the standard signal parameterization. The 
difference in the signal fraction from the different fits de- 
termines the systematic uncertainty. 

The result also depends on the dijet invariant mass 
resolution which in turn depends on the jet energy res- 
olution. To estimate the systematic uncertainty due to 
this effect, we introduce an additional Gaussian smearing 
relative to the jet energy Other systematic uncertainties 
that affect the signal shape but have smaller effects on 
the result are the initial and final state radiation effects. 

A summary of all systematic uncertainties on the signal 
fraction and their effect on the measurement is given in 
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Table [TJ The total systematic uncertainty on the signal 
fraction is estimated to be 25%. 

The systematic uncertainties affecting the acceptance 
are evaluated by counting the number of events that pass 
the selection cuts, after varying the various uncertainty 
sources. The sources that have been taken into account, 
as well as the actual effect on the acceptance are listed 
in Table HI 

The overall uncertainty on the cross section is given by 
taking into account the uncertainties in the signal frac- 
tion, the acceptance and the luminosity. The total effect 
is estimated to be 26%. The total uncertainty in the 
measurement is given by the statistical and systematic 
uncertainties added in quadrature. Taking into account 
the systematic uncertainties, the significance of the mea- 
surement is 1.7 'a. 



Source 


Signal Fraction 


Jet Energy Scale 


10% 


Jet Resolution 


10% 


Background Shape 


20% 


Initial State Radiation 


5% 


Final State Radiation 


< 1% 


Source 


Acceptance 


Jet Energy Scale 


3% 


Jet Resolution 


< 1% 


Initial State Radiation 


2% 


Final State Radiation 


3% 


Efficiency Factor 


3% 


Source 


Cross Section 


Total Signal Fraction 


25% 


Total Acceptance 


5% 


Luminosity 


6% 


Total Effect in Cross Section 


26% 



TABLE I: The systematic uncertainties and their effect in the 
signal fraction, the acceptance and finally the cross section. 
All uncertainties are added in quadrature. 



Taking into account both the statistical and systematic 
error in the number of signal events, we measure ctww x 
BR{W -> £v e ,W -> jets) + cr W z x BR(W -> tu t , Z -> 
jets) = 1.47±0.77(stat)±0.38(sys) pb, which is consistent 
with the SM theoretical prediction for the cross section, 
2.09 ± 0.12 pb. We set a 95% confidence level (C.L.) 
upper limit for the measured cross section. Given that 
the uncertainties follow a Gaussian distribution, the 95% 
C.L. limit can be set by the estimated value plus 1.65 
standard deviations [TB]. The 95% C.L. upper limit set 
for the cross section is a x BR < 2.88 pb. 

In summary, we have used 1.2 fb^ 1 of CDF II data 
to search for WW and WZ production in the lepton 



plus jets final state. We use an ANN to discriminate the 
signal from the background. This technique improves 
the expected statistical significance by w 10% and the 
expected signal fraction by 50%, as compared with an 
event selection based on conventional cuts. We find no 
evidence for anomalous WW and WZ production in the 
lepton plus jets final state and we set a 95% C.L. upper 
limit for the cross section at a x BR < 2.88 pb. 

We thank the Fermilab staff and the technical staffs 
of the participating institutions for their vital contribu- 
tions. This work was supported by the U.S. Department 
of Energy and National Science Foundation; the Italian 
Istituto Nazionale di Fisica Nucleare; the Ministry of 
Education, Culture, Sports, Science and Technology of 
Japan; the Natural Sciences and Engineering Research 
Council of Canada; the National Science Council of the 
Republic of China; the Swiss National Science Founda- 
tion; the A. P. Sloan Foundation; the Bundesministcrium 
fur Bildung und Forschung, Germany; the Korean Sci- 
ence and Engineering Foundation and the Korean Re- 
search Foundation; the Science and Technology Facilities 
Council and the Royal Society, UK; the Institut National 
de Physique Nucleaire et Physique des Particules/CNRS; 
the Russian Foundation for Basic Research; the Ministe- 
rio de Ciencia e Innovation, and Programa Consolider- 
Ingenio 2010, Spain; the Slovak R&D Agency; and the 
Academy of Finland. 



[1] A. Sfyrla, Ph.D thesis, University of Geneva (2008) 

(FERMILAB-THESIS-2008-30) . 
[2] K. Hagiwara et al, Nucl. Phys. B282, 253 (1987). 
[3] A. Bhatti et al., Nucl. Instrum. Methods Phys. Res., Sect. 

A566, 375 (2006). 
[4] W.-M.Yao et al, J. Phys. G 33, 1 (2006). 
[5] J.M. Campbell and R.K. Ellis, Phys. Rev. D 60, 113006 

(1999). 

[6] V. M. Abazov et al (DO Collaboration), 

arXiv:0810.3873vl [hep-ex]. 
[7] T. Aaltonen et al. (CDF Collaboration), Phys. Rev. D 

77, 011108 (2008). 
[8] D. Acosta et al, Phys. Rev. D 71, 032001 (2005) 
[9] We use a cylindrical coordinate system along the proton 

direction in which 8 (<f>) is the polar (azimuthal) angle. 

We define r] = — ln(tan(6 l /2)), pr = psin6<, Et = Esm8, 

$ T = — | J>\ E T fiT,i\, where n,T,i is a unit vector in the 

transverse plane that points from the beamline to the 

i th calorimeter tower, and Mr = [2 • pr • ' (1 

cfir- A<j>(lepton, $ T )\ 1/2 . 
[10] T. Sjostrand, S. Mrenna, and P. Skands, J. High Energy 

Phys. 0605 (2006) 026. 
[11] D. Acosta et al. (CDF Collaboration), Phys. Rev. D 70, 

072002 (2004). 

[12] F. Abe et al. (CDF Collaboration), Phys. Rev. D 45, 
1448 (1992). 

[13] M. L. Mangano et al, J. High Energy Phys. 07 (2003) 
001. 



9 

[14] Neural Networks for Pattern Recognition, Christopher 
M. Bishop, Oxford Science Publications. 

[15] CDF Statistics Committee Recommendations, Neu- 
ral Networks: http://www-cdf.fnal.gov/physics/ 
statistics/recommendations/selection. html#net 

[16] Statistical Data Analysis, Chapter 9.3: Confidence inter- 
val for a Gaussian distributed estimator, Glen Cowan, 
Oxford Science Publications. 



